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(57) The present invention leverages interrelation- 
ships of a relational schema by utilizing simplified anal- 
ysis heuristics to define and build dimensional models 
automatically. A user can also participate interactively 
in both the definition and building processes to influence 
an outcome. The relational schema is comprised of re- 



lational databases and the like. The dimensional models 
are comprised of OLAP objects and the like such as RO- 
LAP and MOLAP objects. A means is also provided for 
allowing a user to initiate via a single user action, such 
as a single computer mouse click, defining a dimension- 
al model, bmiding a dimensional modeL ano^orboth de- 
fining and building a dimensional modeL 
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Description 
TECHNICAL FIELD 

[0001] The present invention relates generally to data 5 
analysis , and more particularly to systems and methods 
for automatically generating and building a dimensional 
model based on a relational schema utilizing simplified 
analysis heuristics. 

BACKGROUND OF THE INVENTION 

[0002J Modem society has come to depend heavily on 
computers and computer technology. It is especially 
prevalent in the business arena where companies com- 
pete fiercely for customers and product sales. A com- 
pany with just-in-tirne Inventory and well focused adver- 
tising strategies generaily produces a product cheaper 
and delivers it faster to a customer than a competitor. 
Computer technofogy makes this type of business edge 
possible by networking businesses, information, and 
customers together. Although originally computers com- 
municated to other computers via networks that only 
consisted of local area networks {LANs) : the advent of 
the Internet has allowed virtually everyone with a com- 
puter to participate in a global network, This allows small 
businesses to be competitive with larger businesses 
without having to finance and build a network structure. 
[0003] As computing and networking technologies 
become more robust, secure and reliable, more con- 
sumers, wholesalers, retailers, entrepreneurs, educa- 
tional institutions and the like are shifting paradigms and 
employing networks, such as the Internet, to perform 
business instead of the traditional means. Many busi- 
nesses are now providing web sites and on-line servic- 
es. For example, today a consumer can access his/her 
account via the Internet and perform a growing number 
of available transactions such as balance inquiries, 
funds transfers and bill payment, 

[0004] Typically businesses gather customer and 
business information and store it in large entities known 
as databases. These are collections of information or- 
ganized so that a computer program can quickly select 
desired pieces of data. The databases can be astronom- 
ical in size and expand exponentially as technology al- 
lows more and more data to be collected. Generally, da- 
tabases are organized by fields, records, and files. A 
field is a single piece of information: a record is one com- 
plete set of fields; and a file is a coHeeiion of records. A 
database management system (DBMS) is utilized to ac- 
cess information from the database. The DBMS is a 
group of programs that enables a user to enter struc- 
ture, and seiect data in a database. 
[0005 J As often occurs in business, there is always a 
push to increase profits and lower operating costs. 
Thus, Business intelligence (Bl) solutions were devel- 
oped to aid in accessing information from large databas- 
es. Most businesses in recent times have migrated to 



relational type databases where data is interrelated. Da- 
ta warehouses were developed to store tactical informa- 
tion to answerthe "who" and "what" questions about the 
stored data related to previous events. However, this 
proved limiting due to the fact that data warehouses only 
have the capability of retneving historical data. There- 
fore, on-iine analytical processing (OLAP) systems 
were developed to not oniy answer the "who" and 
"what", but aiso the "what if" and "why" of the data. 
OLAP systems are multidimensional views of aggregate 
data that aNow analysts, business managers, and exec- 
utives and the like to gain insight into the information 
through a quick, reHable, interactive process. A mam 
component of OLAP is an OLAP server, which is situat- 
ed between a client and a DBMS. The OLAP server un- 
derstands how data is organized in the database and 
has special functions for analyzing the data. 
[0006] Analysis tools, including OLAP tools, help to 
reduce the access times to extreme amounts of data. 
By utilizing these tools, a usercan ask general questions 
or "queries" about the data rather than retrieve all the 
data verbatim. Thus, "data about data" or metadata 
helps expedite the query process and reduce the re- 
quired network bandwidth. Dimensional models, such 
as OLAP objects, piay an important role in this type of 
analysis. The dimensional models are constructed or 
"built" from the data in the database. As is typical in most 
data analysis systems, the ending data supplied to a us- 
er depends heavily on the integrity of the dimensional 
model it is based upon. Thus, it is increasingly important 
that the dimensional model be built to accurately reflect 
information derived from the data in the database. 
[0007] it stands to reason that when a database size 
increases, the dimensional model can also increase in 
size. This also increases the complexity of a required 
dimensional model. A user attempting to build the model 
will find a continuing challenge to ensure that it is con- 
structed properly, requiring a great deal of time and ef- 
fort. Add to this, the increasing complexity of a model 
having greater than three dimensions, numerous 
amounts of measures and intricate timing and the like, 
and it becomes a great undertaking. When a model is 
built, it must always contain information that can be em- 
ployed to extract an answer to an end user's query. 
Thus, the structuring of the dimensional mode] or "cube" 
is equally important. Different businesses require an- 
swers to different queries even though the database in- 
formation may be simNar. Different departments in the 
same business may also require different answers than 
other departments. 

[0008] For a business to remain competitive, it must 
always strive to perform better than its competition. Uti- 
lizing smarter and more intuitive business solutions aug- 
ment this performance. Dimensional modeling, like 
OLAP objects, is key tc aiding businesses in their battle 
to be the best. It allows valuable and ^'hidden" informa- 
tion to be extracted from data stores which are not avail- 
able without this analysis technique. As more and more 
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businesses discover their bidden data, dimensional 
modeling will prove critical in the success of a business 
in a tight market place. 

SUMMARY OF THE INVENTION 5 

[0009] The following presents a simplified summary 
of the invention in order to provide a basic understand- 
ing of some aspects of the invention. This summary rs 
not an extensive overview of the invention, it is not in- to 
tended to identify key/critical elements of the invention 
or to delineate the scope of the invention. Its sole pur- 
pose is to present some concepts of the invention In a 
simplified form as a prefude to the more detailed de- 
scription that is presented later. 15 
[0010] The present invention relates generally to data 
analysis, and more particularly to automatically gener- 
ating and building dimensional models based on a rela- 
tional schema utilizing simplified analysis heuristics. 
Analysis heuristics are leveraged to automatically ere- 20 
ate definitions of dimensional models and also to build 
dimensional models from the definitions. By automati- 
cally converting database information into a dimension- 
al modei such as OLAP objects like MOLAR (Multidi- 
mensional OLAP) and/or ROLAP (Relational OLAP) ob~ 25 
jects and the Sike : users gam an ability to construct the 
modef with one click of a computer mouse and/or inter- 
actively influence the building of the model during its 
construction. The present invention also allows for au- 
tomatically defining a modef and allowing a user to 30 
change a definition interactively before building the 
model, providing a faster and a more user-friendiy meth- 
od and system of developing objects for use with OLAP 
toofs. 

[001 1] The present invention also facilitates data 35 
analysis by reducing the required level of skill necessary 
to define a dimensional model and also reduces the 
amount of human error in the model definition. While re- 
lieving the tedrousness of defining the model the 
present invention also allows for user interaction so that *o 
advanced users can benefit from the automated fea- 
tures while still being abfeto influence the outcome, per- 
mitting a wider experience base of users without limiting 
expert users. The present invention also provides a mul- 
ti-phased approached to allow the definition of the di- *5 
mensionai model to be controlled independently of the 
building of the dimensional model, in this fashion, all or 
part of a process can be automated and/or interactively 
influenced via a user interface, This flexibility drastically 
decreases the development time of a dimensional mod- so 
el and ; at the same time, enables interactivity, allowing 
a userto quickly build a model and adapt it as necessary, 
maximizing user-friendliness, increasing model devel- 
opment speed, and providing reliable, high integrity 
OLAP objects. 55 
[0012] To the accomplishment of the foregoing and re- 
lated ends : certain illustrative aspects of the mventron 
are described herein in connec^on with the following de- 



scription and the annexed drawings. These aspects are 
indicative, however, of but a few of the various ways in 
whtch the principles of the invention may be employed 
and the present invention ts intended to include all such 
aspects and their equivalents. Other advantages and 
novel features of the invention may become apparent 
from the following detailed description of the invention 
when considered in conjunction with the drawings, 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] 

FIG. 1 is a block diagram of a data analysis system 
in accordance with an aspect of the present inven- 
tion. 

FIG. 2 is a block diagram of phasing of a database 
analysis system in accordance with an aspect of the 
present invention, r 

FIG. 3 is a flow diagram of a method of constructing 
a dimensional model in accordance wrth an aspect 
of the present invention. 

FIG. 4 is a flow diagram of a method of defining a 
dimensional model in accordance with an aspect of 
the present invention. 

FIG. 5 is a flow diagram of a method of building a 
dimensional model rn accordance with an aspect of 
the present invention. 

FIG, 6 is a table of a software process of interfacing 
with a user in accordance with an aspect of the 
present invention. 

FIG. 7 is a screen shot of a welcome user interface 
in accordance with an aspect of the present inven- 
tion, y 

FIG. S is a screen shot of a cube definition user in- 
terface in accordance with an aspect of the present 
Invention. 

FIG. 9 is a screen shot of a data source selection 
user mterface in accordance with an aspect of the 
present invention. 

FIG, 10 is a screen shot of a table type detection 
user interface m accordance with an aspect of the 
present invention. 

FIG. 11 is a screen shot of a table type selection 
user interface in accordance with an aspect of the 
present invention, 

FIG. 12 is a screen shot of an existing dimensions 
user interface in accordance with an SLspect of the 
present Invention. 

FIG . 1 3 is a screen shot of a measure selection user 
interface in accordance with an aspect of the 
present invention. 

FIG. 14 is a screen shot of a dimension hierarchies 
creation user interface in accordance with an as- 
pect of the present invention. 

FIG. 15 is a screen shot of a time period definition 
user interface in accordance with an aspect of the 
present invention. 
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FIG. 16 is a screen shot of a newiy -created dimen- 
sions user interface in accordance with an aspect 
of the present invention. 

FIG. 17 is a screen shot of a finishing a cube crea- 
tion user interface in acccrdance with an aspect of 
the present invention. 

FIG. 18 is a screen shot of a new manual measure 
definition user interface in accordance with an as- 
pect of the present invention. 
FIG. 19 is a screen shot of manual existing dimen- 
sions user interface in accordance wrth an aspect 
of the present invention. 

FIG. 20 is a screen shot of a new manual dimen- 
sions definition user interface in accordance with an 
aspect of the present invention. 
FIG. 21 is a screen shot of a manual time period 
definition user interface in accordance with an as- 
pect of the present invention. 

FIG. 22 illustrates an example operating environ- 
ment in which the present invention can function. 
FIG. 23 illustrates another example operating envi- 
ronment in which the present invention can func- 
tion. 

DETAILED DESCRIPTION OF THE INVENTION 

[0014] The present invention is now described with 
reference to the drawings : wherein like reference nu- 
merals are used to refer to like elements throughout. In 
the following description, for purposes of explanation, 
numerous specific details are set forth in order to pro- 
vide a thorough understanding of the present invention, 
It may be evident, however that the present invention 
may be practiced without these specific details. In other 
instances, well-known structures and devices are 
shown in biock diagram form in order to facilitate de- 
scribing the present invention 

[0015] As used in this application, the term ^compo- 
nent" is intended to refer to a computer- related entity, 
either hardware, a combination of hardware and soft- 
ware, software, or software in execution. For example ; 
a component may be ; but is not limited to being, a proc- 
ess running on a processor, a processor, an object, an 
executable, a thread of execution, a program, and/or a 
computer, By way of illustration, both an application run- 
ning on a server and the server can be a computer com- 
ponent. One or more components may reside within a 
process and/ or thread of execution and a component 
may be localized on one computer and/ or distributed be- 
tween two or more computers. A "thread" is the entity 
within a process that the operating system kernel sched- 
ules for execution. As is well known in the art, each 
thread has an associated "context' 5 which is the volatiie 
data associated with the execution of the thread. A 
thread's context includes the contents of system regis- 
ters and the virtual address belonging to the thread's 
process. Thus, the actual data comprising a thread's 
context varies as it executes. 



[0016] In order for an OLAP system to operate, it 
needs a dimensional object or "model" from which to 
draw information from. The dimensional object is gen- 
erally derived from a database and can contain more 
5 than three dimensions. Typically, dimensional objects 
are referred to as "cubes." This provides a simple illus- 
tration when referring to a dimensional object. The 
OLAP system retrieves information from the dimension- 
al objects when a query from a user is submitted. Thus. 

10 success of the OLAP system in replying to queries 
weighs heavily upon having a good, well-structured di- 
mensional object or model. A poorly constructed model 
can slow query response time and/or not allow the 
OLAP system to have the capability to respond correctly 

1 $ to the query. Generally speaking, a user constructing a 
model must be very knowledgeable about both the da- 
tabase being employed and the types of queries the 
OLAP system will be called upon to answer. This can 
be a long and tedious effort for even a highly skilled user. 

20 [0017] The present invention allows a dimensional 
model to be defined and constructed with a single user 
action, it also allows a user to interact with the process 
to influence the outcome of the definition and/ or the 
build. Thus, a lesser experienced user can have a di- 

25 mensional model or "cube" defined and built with a sin- 
gle mouse click. An advanced user, however, still has 
the capability to influence the defining of the cube and/ 
or the buildmg of the cube on an interactive level. This 
allows the cube to be tailored as the user sees fit. A typ- 

30 ical business employing the present invention saves 
time and money developing an OLAP system that meets 
its business objectives due to the present invention's 
ease of use and flexibility in skill level required to oper- 
ate it. This reduces costs and complexities of providing 

35 business information allowing the possibility of eliminat- 
ing a need to hire experts to construct a proper dimen- 
sional model. The dimensional model can then be 
tweaked as necessary to formulate a data analysis sys- 
tem that meets the business needs of the user 

40 [0018] Typically, databases are relational in nature 
meaning that the data has Hnks to other data that resides 
in the database. The present invention employs simpli- 
fied analysis heuristics to exploit these links from rela- 
tional schema such as a relational database. In this 

45 manner, a cube or dimensional model definition is con- 
structed. Once the definition is completed, the present 
invention builds a cube based on the cube definition. 
Both phases can be done automatically without any user 
intervention. However it is aiso possible for the user to 

50 interact with both phases of the present invention, This 
provides both a "one click" solution (e.g., one click of a 
computer mouse) and an interactive solution for in- 
creased flexibility. 

[0019] In FIG. 1, a block diagram of a data analysis 
55 system 1 00 in accordance with an aspect of the present 
invention is illustrated. The data analysts system 100 is 
comprised of a relational schema 102, such as a rela- 
tional database and the like, a data analysis component 
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104 : a dimensional model 106 or cube, such as an 
OLAP object and the like, and a user interface 1 08. The 
data analysis component 104 processes data from the 
relational schema 1 02 to create the dimensional object 
1 06. This is accomplished by first defining a dimensional 
model and then building the dimensional model 106 
from that definition. The user interface 1 OS ailows for a 
user to interact with the processing taking place in the 
data analysis component 104, This allows the user to 
redefine a dimensional model definition and/or influence 
how the dimensional model 106 is constructed. It atso 
allows a user to simply perform a single user action, 
such as a computer mouse click and the like, to initiate 
the data analysis component 1 04 to automatically define 
the dimensional model 106 and/or to construct the di- 
mensional model 106. Thus, a lesser experienced user 
can utiiizethe present invention with a minimal amount 
of effort. Likewise, an expert user can still interact with 
the present invention and influence the process as they 
see fit. The relational schema 102 is comprised of data 
that has interrelations with other data found in the rela- 
tional schema 102. It is these relations that are lever- 
aged to formulate a dimensional model using simplified 
analysis heuristics. 

[0020] Relational databases store data in tables that 
are two dimensional. The tables have rows (records or 
objects) and columns (fields or attributes). Data items 
at an intersection of a row and a column are called a cell 
and consist of attribute values. Multiple values are not 
stored in a single cell. Relational database tables are 
"normalized" so data is not repeated more often than is 
necessary, The table columns depend on a primary key ; 
pk> (a unique value in the column) to identify the column. 
Once a specific column is identified, data from one or 
more rows associated with that column may be obtained 
or changed Foreign keys, fk, are primary keys to infor- 
mation in other tabies that relate to the information as- 
sociated with a table having the primary key of concern. 
[0021] The relationships between several entities 
such as primary keys and foreign keys can be mapped 
to express cardinality. For binary relationship sets be- 
tween entity sets, the mapping cardinality can be one- 
to-one, one-to-many, many-to-one, and many-to-many. 
The relationships between a primary key, pk f (to repre- 
sent one of the "many") and a foreign key, fk, can be 
expressed as fkfpk pairs. Graphically, the reiationship 
can be drawn as an "arc" between the primary key and 
the foreign key. Direction a I Ety can be indicated by utiliz- 
ing arrows on the arc to show whether the reiationship 
directionality rs l 'm H or "eut ,: . 

[0022} Likewise, dimensionai models have character- 
istics or parameters that help define their structure, such 
as a tuple. The tuple is used to define a slice of data 
from a cube, it is composed of an ordered collection of 
one member from one or more dimensions. The tuple Is 
used to identify specific sections of multidimensional da- 
ta from the cube. It can be composed of one member 
from each dimension in a cube to completely describe 



a cell value. 

[0023] Turning to FIG. 2, a block diagram of phasing 
200 of a database analysis system in accordance with 
an aspect of the present invention is shown, in one as- 
5 pectof the present invention, processing is broken down 
into two phases. Thus, the phasing 200 is comprised of 
a label phase 202 and a dimensional model structure 
phase 204. The labeling phase 202 is comprised of 
processing that utilizes simplified analysis heuristics to 

10 define a dimensional model. This includes, but is not lim- 
ited to, analyzing interrelations between data in a rela- 
tional schema. For example, an associated graph of the 
relational schema is defined as being a "tuple (K A/)", 
where V is a set of vertexes for every table found in the 

f£ schema, A is a set of arcs for every relationship in the 
schema, and fisa function where f(v1, v2) - a1 2 (where 
v represents a member of set V and a represents a 
member of set A)., if and only if there is a relationship 
between tables associated with v1 and v2 (named t1 

20 and £2, respectively) and the relationship is t1:t2= n:1, 
where n represents an integer from one to infinity. A "la- 
bel" is then defined for each structure, such as a tabfe L 
based on characteristics of an arc from pair of foreign 
key/primary key (fk/pk) connections from the associated 

25 structures (e.g... tables). Details of labeling are de- 
scribed infra. 

[0024J In another instance of the present invention, 
the dimensional model structure phase 204 is com- 
prised of building a dimensional model as defined during 

30 the label phase 202. Thus, the present invention can 
build a cube (dimensional model) automatically based 
on this definition. However, in other aspects of the 
present invention, a user can intercede and augment 
and/or change the definition before the cube is com 

35 structed. This allows the userto interact with the phasing 
200 as much or as little as is needed and/ or desired. 
The actual details of constructing a dimensional object 
are discussed infra. 

[0025] In yet another instance of the present inven- 

40 tion, a data analysis system is comprised of a means for 
utilizing simplified analysis heuristics for defining a di- 
mensional model based on data interrelations from a re- 
lational data schema, means for building the dimension- 
al model defined by the automated structure labeling 

4$ system, means to allow a user to interact with the auto- 
mated structure labeling system and/or the automated 
model building system, and means for allowing a user 
to initiate with a single user action the automated struc- 
ture labeHng system and/ or the automated model buiid- 

$o jng system. 

[0026] in view of the exemplary systems shown and 
described above, methodologies that may be imple- 
mented in accordance with the present invention will be 
better appreciated with reference to the flow charts of 

55 FiGs. 3-5. While, for purposes of simplicity of explana- 
tion, the methodologies are shown and described as a 
series of blocks, it is to be understood and appreciated 
that the present invention is not limited by the order of 
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the blocks, as some blocks may, in accordance with the 
present invention, occur in different orders and/or con- 
currently with other blocks from that shown and de- 
scribed herein Moreover, not ail illustrated blocks may 
be required to implement the methodologies in accord- 
ance with the present invention. 

[0027] The invention may be described in the general 
context of computer-executable instructions, such as 
program modules, executed by one or more compo- 
nents. Generally, program modules include routines, 
programs , objects, data structures, etc. that perform 
particular tasks or implement particular abstract data 
types. Typically the functionality of me program modules 
may be combined or distributed as desired in various 
embodiments. 

[0028] To allow great flexibility in the present inven- 
tion, user interactions can be incorporated to influence 
a resulting dimensional model. In FIG. 3. a flow diagram 
of a method 300 of constructing a dimensional model in 
accordance with an aspect of the present invention is 
illustrated. The method 300 starts 302 by labeling com- 
ponents of a relational schema 304 : such as tables in a 
relational database and the like. A determination is 
made as to whether any user input is available regarding 
the labeling of the components 306. If user inputs are 
available, the user inputs are incorporated 308 and a 
cube structure is defined 31 0. If , however inputs are not 
available, the cube structure is defined 31 0 from the la- 
bels of the components without any user input, Once the 
cube structure is defined 310, a determination is made 
as to whether any user inputs are available relating to 
the cube's definition 312, If user inputs are available, 
they are incorporated into the cube's definition 314 and 
the cube is built 31 6, ending the flow 31 8, If no user in- 
puts relative to the cube's definition are available 312, 
the cube is built 316 using the cube definition without 
any user inputs, ending the flow 318. Although this as- 
pect of the present invention is illustrated with discrete 
points at which user inputs are accepted, other aspects 
of the present invention can accept user inputs at any 
point during processing for, up to and including, a 100% 
interactive capability. Likewise, the process can be to- 
tally automated with no interactive participation by a us- 
er. 

[0029] In order to define a dimensional model accord- 
ing to one aspect of the present invention, structures, 
such as tables and the Nke ; pertinent to a database must 
be labeled to aid in building a dimensional model. Re- 
ferring to FIG. 4, a flow diagram of a method 400 of de- 
fining a dimensional model in accordance with an aspect 
of the present invention is depicted The method 400 
starts 402 by identifying partition clusters 404. All vor- 
texes that have only "out" arcs are reviewed to identify 
those structures that have equivalent labeling {i.e., 
same pk, same cardinalities of columns with measure 
types not used in an fk/pkpavr). Partitions are grouped 
into detail clusters and are analyzed as a singie vertex. 
Only when actually building a cube are appropriate par- 



titions created. Afi structures with only "our arcs are 
then labeled as strong fact 406. All structures with two 
or more "m" arcs are labeled as strong dimensions 408 . 
AM structures with one "in" arc from a dimension are la- 

5 be led as strong dimensions 41 0, Ali structures with one 
"in" arc from a fact table and zero or one "out" arc are 
labeled as strong dimensions 41 2. Ail structures with ex- 
actly one "in" arc from a fact structure and two or more 
arcs going into a cluster that contains dimensions la- 

10 beted as starting from other fact structures are labeled 
as both dimensions and facts 414. All structures with 
one "in" arc from a fact table are labeled as dimensions 
41 6, ending the fiow 418. The structures are comprised 
of tables and the like as found in relational schemas 

f5 such as refationai databases and the fike. Table types 
can include fact tabies, degenerate tables, and dimen- 
sion tables. Those skilled in the art can appreciate that 
other aspects of the present invention can include meth- 
ods having more or fewer steps as noted supra and still 

so be within The scope of the present invention. It should 
also be noted that a user can interact at any point during 
this process . 

[0030] It can also be appreciated that in other instanc- 
es of the present invention, a user can edit and/or aug- 

£5 ment a structure labeling schema. This allows the user 
an advanced level of interaction in which they can influ- 
ence rules that govern the labeling of structures during 
a definition phase of constructing a dimensional model. 
In this manner, the labeling schema can be tweaked to 

30 provide a tuned process for automatically constructing 
dimensional models. It is also possible in yet another 
instance of the present invention to permit a user to de- 
termine a user interaction level to determine an amount 
of interactivity permitted during a dimensional model 

3$ construction. Such levels can include, but are not limited 
to, total automation, limited interaction, and/ or full inter- 
action (e.g.. manual) type modes. These modes or lev- 
els are not limited to only the labeling and definition 
processing, but can also be applied to building the di~ 

40 mensional model described infra, 

[0031] Typically, after a definition has been derived 
from a database and any user inputs have been ac- 
counted for another aspect of the present invention 
builds a dimensional modei based on the derived defi- 
es niliom Looking at FIG. 5, a flow diagram of a method 
500 of building a dimensional mode! in accordance with 
an aspect of the present invention is shown . The method 
500 starts 502 with creating a cube with as many details 
as there are fact clusters 504. A determination is then 

so made as to whether multiple partitions exist 506. If oniy 
one partition exists and if every detail holds a name of 
a fact table it hosts , the name utilized is the name of the 
first and only partition 508. If multiple partitions exist and 
if every detail holds a name of a fact table It hosts, the 

55 name is a variable percentage length of a common sub* 
string of the fact tabie name 510, A determination is then 
made as to whether numeric columns of a fact table exist 
512, If no numeric columns exist, a count measure is 
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created with an expression of "coimtO" 514, if howev- 
er numeric columns do exist, the numeric columns of a 
fact table are measure columns and measures are cre- 
ated for them 518. Once this is accomplished, all details 
are linked to dimensions 518. Hidden time dimensions 
inside fact tables are then detected for every detail 520. 
This includes time columns that span into time dimen- 
sions with an "(AH)-Y-Q-M hierarchy, Ji representing 
yearly, quarterly, and monthly, respectively. Natural and/ 
orvirtual hierarchies are then built for every dimension 
522. A determination is then made as to whether it is 
desired to enhance the cube structure 524, If not, the 
How ends 526. If it is desirable to enhance the cube 
structure, hidden hierarchies inside the dimension ta- 
bles are detected by analyzing interesting pairs (le, t 
pairs that are typically associated as occurring together) 
of dimension properties 528. ending the flow 526. 
[0032] The aforementioned flows are meant to be rep- 
resentative flows of various methods of the present in- 
vention. They in no way encompass every iteration and 
variance within the scope of the present invention. 
Those skilled in the art can appreciate that a method 
can incorporate modifications and stili remain within the 
purview of the present invention. 

[0033] The present Invention allows user interaction 
at any desired level. This permits a novice userto initiate 
an automated process with a single user action and also 
allows an expert user to intercede at will to fine tune a 
dimensional model for specific needs. One method of 
allowing this level of user interactivity is to provide a 
graphical style user interface, A user can then initiate, 
change, review, and/or augment the present invention 
easily. One skilled in the art can appreciate that a mul- 
titude of varying graphical interfaces are possible. As an 
example of just one possible interface of the present in- 
vention, an illustration of a graphical user interface set 
or "process" is described. Generally a user interface is 
comprised of at least one graphic, often a set of graph- 
ics, that is generated by a computing device and shown 
on a display for visual reference and interaction by the 
user This set of graphics is typically referred to as a 
"graphical user interface" (GUI) even though it is com- 
prised of more than one graphic. Thus ; components 
such as sub-graphics ; drop down menus and tables, se- 
lection devices, and text entry boxes and the like are all 
considered part of the graphical user interface. 
[0034] Likewise, the present invention also includes 
non-graphical user interfaces such as text based user 
interfaces. Although generally not as easy to interface 
with as a graphical interface, a text based interface can 
still be empioyed by the present invention to allow user 
interaction at any fevei and to also allow a single user 
action, such as a key stroke, to initiate an automated 
process. 

[0035] Turning to FIG. 6, a table of a graphical user 
Interface process 500 of interacting with a user in ac- 
cordance with an aspect of the present invention is il- 
lustrated. Th is process 800 illustrates an overview of dif- 



ferent graphical interfaces that can be employed In the 
present invention Examples of actual screen shots of 
these interfaces are shown in FIGs. 7-21 . In genera!, a 
welcome step 602 brings up a screen to welcome a user 
s such as a user interface 700 Illustrated in FIG. 7. This 
interface 700 explains the purpose of an aspect of the 
present invention and prepares a user for interacting 
with it A cube creation method interface step 604 then 
allows a user to select a manual track 606 or an auto- 
10 malic/semiautomatic tmck 60B (RDBMS/DW tracks, re- 
latronai database management system and data ware- 
house, respectively). A graphical user interface 800 In 
FIG. 8 depicts an example of such an interface. At this 
interface, a user can cnoese to accept suggestions (in- 

15 teract with the present invention) during a definition and 
cube building process. Typically, default is set to allow 
suggestions {i.e., allow a user to interact). 
[0036] If the automatic/semiautomatic track is cho- 
sen, the user is presented with a source selection capa- 

20 bility 610 like that shown in FIG 9's user interface 900 . 
The user can select a desired oata source to base the 
cube processing upon . Once a source has been select- 
ed, a user is presented with results from processing that 
identifies relationships and prrmary and foreign keys 

25 and suggests potential fact tables . dimension tables and 
hierarchies for those dimensions 612, Generally, the in- 
formation is provided to a user as the processing dis- 
covers them as illustrated in a graphical user interface 
1000 shown in FIG. 10, The process automatically de- 

^0 tects and suggests a table type for each of the tables of 
the selected data source 614. The user can then select 
only the tables required for creating the cube. Such an 
interface is depicted in a graphical user interface 1100 
shown in FIG. 11 . In other instances of the present m- 

35 vention (not shown), a diagram graphical user interface, 
as opposed to a table graphical user interface, can be 
presented to a user. 

[0037] After the tables are selected, a user can then 
select and add to the cube definition dimensions that 

40 were previously created 616 via a graphical user inter- 
face 1200 as shown in FIG. 12. The process 600 then 
takes information from an analysis and reports to the 
user suggestions for measure columns from the Dfevi- 
ously supplied data. The user can also refine this data, 

4$ for example, via a graphical user interface 1 300 depict- 
ed in FIG. 13, Typically measures are automatically 
grouped by a measure group function. This function 
groups the measures by topic and dimensional granu- 
larity. Generally, a default name of the grouping is a hu- 

50 man readable version of the table name (e.g., under- 
scoring is removed from the name). It is also possible 
to rename measure groups wathe graphical user inter- 
face 1300, The process 600 then provides a user with 
results from an analysis that creates dimensions and ht- 

55 erarchies 620. Such a graphical user interface 1400 Is 
depicted in FIG, 14. The user can stop the analysis at 
any time and proceed with just the hierarchies generat- 
ed up to thai point in time. The analysis detects hierar- 
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chies for all dimension tables that remain after a filtering 
generated by an existing dimension list. Ail dimension 
tables used by any existing dimension adcedto the cube 
are removed from a list of potential dimension tables to 
generate a dimension for If another dimension table 
finds itself isolated from aH fact tables mo direct link or 
no links through any other remaining dimension tables) > 
it is also removed from the dimension table iist. Once 
this process is completed, a dimension list is created 
from the remaining dimensions 

[0038} As an optional step ; a user can be provided 
with a graphical user interface to specify which column 
contains time interval periods fc a table defined as 
"time" in a previous selection 621. A graphical user in- 
terface 1500 illustrated in FIG. i shows an example of 
how this can be presented to a ^ser. A user is then pre- 
sented with a graphical user interface to relay informa- 
tion from an analysis which a>. romatically generates hi- 
erarchies and properties for a new dimension 824. The 
user can refine these parameters in an interface such 
as a graphical user interface 1600 shown in FIG. 16. 
The user has a capability co create additional dimen- 
sions and! or to edit ones automatically supplied by the 
analysis. Once all param iters are satisfied for the anal- 
ysis, the user is present with a graphical user inter- 
face to finalize the cub: 626. For example, a graphical 
user interface 1 700 show in FIG. 17 aJiows the user to 
name the cube, review its structure and save the cube, 
[0039] The above discussion concerns a user who de* 
sires to interact with an automated process. However 
a user can also manually construct a cube and is still 
supported by graphical user interfaces as noted in the 
process 600 shown in FIG. 6. The manual track 606 in- 
cludes interfaces 628 comprised of a creating a new 
manual measure definition interface, a selecting exist- 
ing dimensions -iterface, a creating new manual dimen- 
sion definitions nterface, and an optional defining a time 
dimension interface. Typical graphical user interfaces 
are depicted in FIGs. 18-21 denoted by graphical user 
interfaces 1 800-21 00 respectively. Thus, the present in- 
vention allows a user to have great flexibility as to a level 
of involvement and skill required by a user. This allows 
a tremendous advantage by employing the present in- 
vention. 

[0040] r hus. one aspect of the present invention is 
comprised of an interface adapted to communicate with 
an automated data analysis system, at least one output 
associated with the interface to provide indications of 
data processing withm the data analysis system relating 
to at east one characteristic, and at teast one input to 
influence the data processing based, at \easi in part, on 
a user's preference of how to define a dimensionai mod- 
el and? or how to construct a dimensionai model. In an- 
other aspect of the present invention, the automated da- 
ta analysis system is comprised of an automated struc- 
ture labeling system utilizing simplified analysis heuris- 
tics for defining a dimensional model based on data in- 
terrelations from a relational data schema and an auto- 



mated model building system for constructing the di- 
mensional modei defined by the automated structure la- 
beling system. In yet another instance of the present 
invention, the output is comprised of dimensions, at- 

5 tributes, aggregate functions, table types, measures, 
and/or measure groups. In still yet another instance of 
the present invention, the input is comprised of dimen- 
sions, attributes, aggregate functions s table types, 
measures, and/or measure groups. In yet other instanc- 

io es of the present invention is further comprised of an 
input to initiate via a single user action such as the au- 
tomated structure labeling system and/ or the automated 
model buifding system. 

[0041 ] In order to provide additional context for imple- 

1$ meriting various aspects of the present invention, FIG. 
22 and the following discussion is intended to provide a 
brief, general description of a suitable computing envi- 
ronment 2200 in which the various aspects of the 
present invention may be implemented. While the inven- 

20 tion has been described above in the genera! context of 
computer-executable instructions of a computer pro- 
gram that runs on a local computer and/or remote com- 
puter, those skilled in the art will recognize that the in- 
vention also may be implemented in combination with 
otherprogram modules. Generally, program modules in- 
clude routines, programs, components, data structures. 
etc. that perform particular tasks and/or implement par- 
ticular abstract data types. Moreover, those skilled in the 
art will appreciate that the inventive methods may be 

30 practiced with other computer system configurations, in- 
cluding single-processor or multi-processor computer 
systems, minicomputers, mainframe computers, as well 
as personal computers, hand-held computing devices, 
microprocessor-based and/ or programmable consumer 

35 electronics, and the like, each of which may operatively 
communicate with one or more associated devices. The 
illustrated aspects of the invention may also be prac- 
ticed in distributed computing environments where cer- 
tain tasks are performed by remote processing devices 

40 that are linked through a communications network. 
However, some, if not all, aspects of the invention may 
be practiced on stand-alone computers, in a distributed 
computing environment, program modules may be lo- 
cated In Jocal and/or remote memory storage devices. 

45 [0042] As used in this application, the term "compo- 
nent" is intended to refer to a computer-related entity, 
either hardware, a combination of hardware and soft- 
ware, software, or software in execution. For example, 
a component may be, but is not limited to. a process 

so running on a processor a processor an object, an exe- 
cutable : a thread of execution, a program, and a com- 
puter. By way of illustration, an application running on a 
server and/or the server can be a component, in addi- 
tion, a component may include one or more subcompo- 

$5 nents. 

[0043] With reference to FIG. 22. an exemplary sys- 
tem environment 2200 for implementing the various as- 
pects of the invention includes a conventional computer 
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2202. including a processing unit 2204. a system mem- 
ory 2206, and a system bus 2208 that couples various 
system components, including the system memory, to 
the processing unit 2204. The processing unit 2204 may 
be any commercially available or proprietary processor 
In addition, the processing unit may be implemented as 
multiprocessor formed of more than one processor 
such as may be connected in parallel, 
[0044] The system bus 2208 may be any of several 
types of bus structure including a memory bus or mem- 
ory controller, a peripheral bus, and a local bus using 
any of a variety of conventional bus architectures such 
as PCI. VESA, MicroChannel, ISA, and EISA, to name 
a few. The system memory 2206 includes read only 
memory (ROM) 2210 and random access memory 
(RAM) 2212. A basic input/output system (BIOS) 2214, 
containing the basic routines that help to transfer infor- 
mation between elements within the computer 2202, 
such as during start-up, is stored in ROM 2210. 
[0045] The computer 2202 also may include, for ex- 
ample, a hard disk drive 221 6, a magnetic disk drive 
2218, e.g., to read from or write to a removable disk 
2220 ; and an optical disk drive 2222 : e.g., for reading 
from or writing to a CD-ROM disk 2224 or other optical 
media. The hard disk drive 2216, magnetic disk drive 
221 8 and optical disk drive 2222 are connected to the 
system bus 2208 by a hard disk drive interface 2226, a 
magnetic disk drive interface 2228. and an optical drive 
interface 2230, respectively. The drives 2216-2222 and 
their associated computer-readable media provide non- 
volatile storage of data, data structures, computer-exe- 
cutable instructions, etc. for the computer 2202. Al- 
though the description of computer-readable media 
above refers to a hard disk, a removable magnetic disk 
and a CD, it should be appreciated by those skilled in 
the art that other types of media which are readable by 
a computer such as magnetic cassettes, flash memory 
cards, digital video disks, Bernoulli cartridges, and the 
like, can also be used in the exemplary operating envi- 
ronment 2200. and further that any such media may 
contain computer-executable instructions for perform- 
ing the methods of the present invention. 
[0046] A number of program modules may be stored 
in the drives 2216-2222 and RAM 221 2 S including an 
operating system 2232, one or more application pro- 
grams 2234, other program modules 2236, and program 
data 2238. The operating system 2232 maybe any suit- 
able operating system or combination of operating sys- 
tems. By way of example, the application programs 
2234 and program modules 2238 can include a data- 
base analysis system and/or an interactive dimensional 
model building system that utilizes data in accordance 
with an aspect of the present invention. Additionally, the 
program data 2238 can include input data for controlling 
andfcr biasing a dimensional model in accordance with 
an aspect of the present mvention. 
[0047] A user can enter commands and information 
into the computer 2202 through one or more user input 



devices, such as a keyboard 2240 and a pointing device 
(e g . a mouse 2242). Other input devices {not shown) 
may include a microphone, a joystick, a game pad. a 
satellite dish f wireless remote, a scanner or the like. 

5 These and other input devices are often connected to 
the processing unit 2204 through a serial port interface 
2244 that is coupled to the system bus 2208. but may 
be connected by other interfaces, such as a parallel port, 
a game port or a universai serial bus (USB). A monitor 

to 2246 or other type of dispiay device is also connected 
to the system bus 2208 via an interface, such as a video 
adapter 2248, in addition to the monitor 2246, the com- 
puter 2202 may include other peripheraf output devices 
(not shown), such as speakers, printers, etc, 

'5 [0048] It is to be appreciated that the computer 2202 
can operate in a networked environment using logical 
connections to one or more remote computers 2260, 
The remote computer 2260 may be a workstation, a 
server computer, a router a peer device or other com- 

20 men network node, and typically includes many or all of 
the elements described relative to the computer 2202, 
although , for purposes of brevity, only a memory storage 
device 2262 is illustrated in FIG, 22. The logical connec- 
tions depicted in FIG, 22 can include a local area net- 

25 work (LAN) 2264 and a wide area network (WAN) 2266 , 
Such networking environments are commonplace in of- 
fices, enterprise-wide computer networks, intranets and 
the Internet, 

[0049] When used in a LAN networking environment. 

30 for example, the computer 2202 is connected to the local 
network 2264 through a network interface or adapter 
2268. When used in a WAN networking environment, 
the computer 2202 typically includes a modem (e.g., tel- 
ephone, DSL. cable, etc.) 2270, or is connected to a 

35 communications server on the LAN, or has other means 
for establishing communications over the WAN 2266, 
such as the Internet. The modem 2270, which can be 
internal or external relative to the computer 2202, is con- 
nected to the system bus 2208 via the serial port inter- 

40 face 2244. In a networked environment, program mod- 
ules (including application programs 2234} and/or pro- 
gram data 2238 can be stored in the remote memory 
storage device 2262, It will be appreciated that the net- 
work connections shown are exemplary and other 

45 means [ag,,, wired or wireless) of establishing a com- 
munications Jink between the computers 2202 and 2260 
can be used when carrying out an aspect of the present 
invention. 

[0050] In accordance with the practices of persons 
so skilled in the art of computer programming, the present 
invention has been described with reference to acts and 
symbolic representations of operations that are per- 
formed by a computer such as the computer 2202 or 
remote computer 2260,. unless otherwise indicated, 
55 Such acts and operations are sometimes referred to as 
being computer-executed, it will be appreciated that the 
acts and symbolically represented operations include 
the manipulation by the processing unit 2204 of electri- 
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cal signals representing data bits which causes a result- 
ing transformation or reduction of the electrical signal 
representation, and the maintenance of data bits at 
memory locations in the memory system (including the 
system memory 2206, hard drive 2216, floppy disks 5 
2220, CD-ROM 2224. and remote memory 2262) to 
thereby reconfigure or otherwise aiterthe computer sys- 
tem's operation, as well as other processing of signals. 
The memory locations where such data bits are main- 
tained are physical locations that have particular elec- 10 
tricaf, magnetic, or optical properties corresponding to 
the data bits. 

[0051] FIG. 23 is another block diagram of a sample 
computing environment 2300 with which the present in- 
vention can interact. The system 2300 further illustrates 15 
a system that includes one or more client(s) 2302. The 
client(s) 2302 can be hardware and/or software [e.g., 
threads, processes, computing devices). The system 
2300 also includes one or more server{s) 2304, The 
server(s) 2304 can also be hardware and/or software 2Q 
{e~g,, threads, processes, computing devices). The 
servers 2304 can house threads to perform transforma- 
tions by employing the present invention for example. 
One possibie communication between a client 2302 and 
a server 2304 may be in the form of a data packet adapt- 25 
ed to be transmitted between two or more computer 
processes. The system 2300 includes a communication 
framework 2308 that can be employed to facilitate com- 
munications between the client(s) 2302 and the server 
(s) 2304. The ciient(s) 2302 are operably connected to 30 
one or more client data store(s) 231 0 that can be em- 
ployed to store information local to the client(s) 2302. 
Similarly, the server(s) 2304 are operably connected to 
one or more server data store(s) 2306 that can be em- 
ployed to store information local to the servers 2304, 35 
[0052J !n one instance of the present invention, a data 
packet transmitted between two or more computer com^ 
ponents that facilitates data analysis is comprised of di- 
mensional model analysis data, based, in part, on data 
from an automated structure labeling system utilizing 
simplified analysis heuristics for defining a dimensional 
model (such as an OLAP object a ROLAP object, and 
a MOLAP object and the like) based on data interrela- 
tions from a relational data schema (such as a relational 
database), data from an automated model building sys- 4$ 
tern for constructing the dimensional mode^ defined by 
the automated structure labeling system, data from an 
input to initiate via a single user action (such as data 
from a computer mouse click) such as the automated 
structure labeling system and/or the automated modei 50 
building system, data from an output associated with an 
Interface (such as data for a graphical user interface) to 
provide indications of data processing of the data anal- 
ysis, and/or data from an input associated with an inter- 
face to influence data processing based, at least in part. 
on a user's preference of how tc define a dimensional 
model and/ or how to construct a dimensional modei. 
[0053] In another instance of the present invention, a 



computer readable medium storing computer executa- 
ble components of a system for facilitating data analysis 
is comprised of an automated data analysis system that 
provides information associated with a data set, based, 
at least in part, upon data from an automated structure 
labeling system utilizing simplified analysis heuristics for 
defining a dimensional model {such as an OLAP object, 
a ROLAP object, and a MOLAP object and the like) 
based on data interrelations from a relational data sche- 
ma (such as a relational database), data from an auto- 
mated mode! building system for constructing the di- 
mensional modei defined by the automated structure la- 
beling system, data from an input to initiate via a single 
user action (such as data from a computer mouse click) 
such as the automated structure labeling system and/or 
the automated model building system, data from an out- 
put associated with an interface (such as data for a 
graphical user interface) to provide indications of data 
processing of the data analysis, and/or data from an in- 
put associated with an interface to influence data 
processing based, at least in part, on a user's prefer- 
ence of how to define a dimensional model and/ or how 
to construct a dimensional model. 
[0054] It is to be appreciated that the apparatus, sys- 
tems and/ or methods of the present invention can be 
utmzed in a data analysis scheme facilitating computer 
components and non-computer related components 
alike. Further, those skilled in the art will recognize that 
the apparatus, systems and/or methods of the present 
invention can be employed in a vast array of electronic 
related technologies, including, but not limited to. com- 
puters, servers and/or handheld electronic devices and 
the like. 

[0055] What has been described above includes ex- 
amples of the present invention. It is, of course, not pos- 
sibie to descnbe every conceivable combination of com- 
ponents or methodologies for purposes of describing 
the present invention, but one ol ordinary skill in the art 
may recognize that many further combinations and per- 
mutations of the present invention are possibie. Accord- 
ingly, the present invention is intended to embrace ail 
such alterations, modifications and variations that fall 
within the spirit and scope of the appended claims. Fur- 
thermore, to the extent that the term "includes" is used 
in either the detailed description orthe claims, such term 
is intended to be inclusive in a manner similar to the term 
"comprising 35 as "comprising" is interpreted when em- 
ployed as a transftionai word in a claim. 



Claims 

1. A data analysis system, comprising: 

an automated structure labeling system utiliz- 
ing simplified analysis heuristics for defining a 
dimensional mode! based on data interrelations 
from a relational data schema: and 
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an automated model building system for con- 
structing the dimensional model defined by the 
automated structure labeling system. 

2. The system of ciaim 1 , further comprising: 

a user interface component that allows a user 
to Interact with at ieast one selected from the 
group consisting of the automated structure la- 
beling system and the automated model build- 
ing system, 

3. The system of claim 2. the user interface compo- 
nent comprising a graphical user interface. 

4. The system of claim 1 , further comprising; 

an input component that allows a user to initiate 
via a single user action at least one selected 
from the group consisting of the automated 
structure labeling system and the automated 
model buildmg system. 

5. The system of claim 4, the input component com- 
prising a single computer mouse click. 

6. The system of claim 1 , the relational data schema 
comprising a relational database, 

7. The system of claim 1 ; the dimensional model com- 
prising at least one selected from the group consist- 
ing of an OLAP object, a ROLAP object, and a MO- 
LAP object. 

8. A data analysis method, comprising: 

defining a dimensional model automatically 
based on data interrelations from a relational 
data schema utilizing simplified analysis heu- 
ristics. 

9. The method of claim 8 : further comprising: 

building the dimensional model automatically 
after defining the dimensional model. 

10. The method of claim 8, further comprismg: 

providing a means to adow a user to initiate via 
a single user action the defining of the dimen- 
sional model. 

11. The method of claim 9. further comprising: 

providing a means to ailow a user to initiate via 
a single user action at least one selected from 
the group consisting of the defining of the di- 
mensional model, the building of the dimen- 
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sionai model and both the defining and building 
of the dimensional model 

12. The method of claim 8 : further comprismg: 

providing at least one structure and at ieast one 
characteristic of the dimensional model interac- 
tively to a user 

13. The method of claim 12, the structure comprising at 
ieast one selected from the group consisting of at 
least one fact table, at ieast one degenerate table, 
and at least one dimension table. 

14. The method of claim 12, the characteristic compris- 
ing at least one selected from the group consisting 
of at least one dimension, at least one attribute, at 
ieast one measure, and at least one measure 
group. 

15. The method of claim 8, further comprising: 

employing user inputs to facilitate in defining 
the dimensional model. 

16. The method of claim 1 5 the user inputs comprising, 
at least m part, a user interaction level comprised 
of at least one selected from the group consisting 
of total automation limited interaction, and full in- 
teraction modes. 

17. The method of claim 9 5 further comprising: 

employing user inputs to facilitate in building 
the dimensional model. 

18. The method of claim 17. the user inputs comprismg. 
at least in pari a user interaction level comprised 
of at ieast one selected from the group consisting 
of total automation, limited Interaction, and full in- 
teraction modes. 

19. The method of cla^m 8 ; the refationaf data schema 
comprising a relational database. 

20. The method of claim 8. the defining of the dimen- 
sional model comprising: 

labeling at least one structure of the dimension- 
al model as at least one selected from the group 
consisting of a strong fact, a strong dimension, 
both a fact and a dimension, a dimension, and 
a fact. 

21 . The method of claim 20, the structure comprising a 
table. 

22. The method of claim 20 ? the labeling based on at 
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least one selected from the group consisting of: 

strong facts comprising structures with only out 
arcs; 

strong dimensions comprising at least one se- 
lected from the group consisting of structures 
with at least two in arcs, structures with one in 
arc from a dimension, and structures with one 
in arc from a fact structure and at least one se- 
lected from the group consisting of zero and 
one out arcs; 

both fact and dimension comprising structures 
with one arc in from a fact structure and at least 
two arcs into a cluster containing dimensions 
labeled by starting from other fact structures; 
and 

dimensions comprising structures with one in 
arc from a fact structure. 

23. The method of claim 20 , further comprising: 

determining clusters of partitions by identifying 
structures having equivalent labeling for vor- 
texes with only out arcs. 

24. The method of claim 23, the equivalent labeling 
comprising identical primary key and identical car* 
dinalities of columns with measure types not used 
in a foreign key/primary key pair. 

25. The method of claim 23, further comprising: 

grouping the partitions in detail clusters and an- 
alyzing as a single vertex. 

26. The method of claim 9 ; the building the dimensional 
model comprising: 

creating as many details as fact clusters; 
storing a tact structure name in a detail in which 
it resides; 

defining numeric columns inside a detail of a 
fact structure as measure columns and creating 
measures for the measure columns; 
creating a count measure when no numeric col- 
umns exist inside a detail of a fact structure; 
inking details to dimensions; 
detecting hidden time dimensions inside a fact 
structure: 

naming a dimensional model based on a detail 
having more measures than any other detail; 
and 

building at least one selected from the group 
consisting of natural hierarchies and virtual hi- 
erarchies for at Jeast one dimension. 

27. The method of claim 8 . the dimensional model com- 
prising at ieast one selected from the group consist- 



ing of an OLAP object, a ROLAP object, and a MO- 
LAR object, 

28. A data analysis system, comprising: 

5 

means for utHi2ing simplified analysis heuristics 
for defining a dimensional model based on data 
interrelations from a relational data schema; 
and 

to means for building the dimensional model de- 

fined by the automated structure labeling sys- 
tem. 

29. The system of claim 28, further comprising: 

15 

means to allow a user to interact with at least 
one selected from the group consisting of the 
automated structure labeling system and the 
automated model building system. 

20 

30. The system of claim 28, further comprising: 

means for allowing a user to initiate with a sin- 
gle user action at least one selected from the 
25 group consisting of the automated structure la- 

beling system and the automated model build- 
mg system, 

31. A user interface, comprising: 

30 

an interface adapted to communicate with an 
automated data analysis system: 
at least one output associated with the interface 
to provide indications of data processing within 
35 the data analysis system relating to at least one 

characteristic: and 

at least one input to influence the data process- 
ing based, at least in part, on a user's prefer- 
ence of at least one selected from the group 
40 consisting of a how to define a dimensional 

model and how to construct a dimensional 
model. 

32. The interface of claim 31 . the automated data anal- 
45 ysis system comprising at least one selected from 

the group consisting of : 

an automated structure labeling system utiliz- 
ing simplified analysis heunstics for defining a 
50 dimensional mode! based on data interrelations 

from a relational data schema; and 
an automated model building system for con- 
structing the dimensional model defined by the 
automated structure labeling system. 

55 

33. The interface of claim 31, the output comprising at 
least one selected from the group consisting of di- 
mensions, attributes, aggregate functions, table 



25 



30 



35 
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types, measures, and measure groups. 

34. The interface of claim 31, the input comprising at 
least one selected from the group consisting of di- 
mensions, attributes, aggregate functions, table 5 
types, measures, and measure groups. 

35. The interface of daim 32, further comprising an in- 
put to initiate via a singie user action at least one 
selected from the group consisting of the automated to 
structure labeling system and the automated model 
building system. 

36. The interface of daim 31 , the interface comprising 

at [east one selected from the group consisting of a is 
graphical user interface and a text based interface, 

37. A data packet transmitted between two or more 
computer components that facilitates data analysis, 

the data packet comprising dimensional model 20 
analysis data, based, in part, on at least one select- 
ed from the group consisting of: 

data from an automated structure labeling sys- 
tem utilizing simplified analysis heuristics for 2s 
defining a dimensional model based on data in- 
terrelations from a relational data schema; 
data from an automated model building system 
for constructing the dimensional model defined 
by the automated structure labeling system: 30 
data from an input to initiate via a single user 
action at feast one selected from the group con- 
sisting of the automated structure labeling sys- 
tem and the automated model building system; 
data from an output associated with an inter- 35 
face to provide indications of data processing 
of the data analysis; and 
data from an input associated with an interface 
to influence data processing based. 

40 

at least in part ; on a user's preference of at least 
one selected from the group consisting of a how to 
define a dimensional model and how to construct a 
dimensionai model 

45 

38. The data packet of claim 37, the input to initiate via 
the single user action comprising data from a com- 
puter mouse click, 

39. The data packet of daim 37. the output associated 5a 
with the interface comprising data for a graphicai 
user interface. 

40. The data packet of daim 37 : the relational data 
schema comprising a relational database. ss 

41 . The data packet of claim 37, the dimensional model 
comprising at least one selected from the group 



consisting of an OLAP object, a ROLAP object, and 
a MOLAP object. 

42. A computer readable medium storing computer ex- 
ecutable components of a system for facilitating da- 
ta analysis, comprising an automated data analysis 
system that provides information associated with a 
data set, based ; at least in part, upon at least one 
selected from the group consisting of: 

an automated structure labeling system utiliz- 
ing simplified analysis heuristics for defining a 
dimensional model based on data interrelations 
from a relational data schema: 
an automated model building system for con- 
structing the dimensional model defined by the 
automated structure labeling system; 
an input to initiate via a single user action at 
least one selected from the group consisting of 
the automated structure labeling system and 
the automated mode! building system; 
an output associated with an interface to pro- 
vide indications of data processing of the auto- 
mated data analysis system; and 
an input associated with an interface to influ- 
ence automated data processing based, at 
least in part on a user's preference of at least 
one selected from the group consisting of a how 
to define a dimensional model and how to con- 
struct a dimensional model, 

43. The medium of claim 42, the input to initiate Wathe 
single user action comprising data from a computer 
mouse click. 

44. The medium of claim 42 : the output associated with 
the interface comprising data for a graphical user 
interface. 

45. The medium of claim 42, the relational data schema 
comprising a relational database. 

48. The medium of claim 42, the dimensional model 
comprising at least one selected from the group 
consisting of an OLAP object., a ROLAP object, and 
a MOLAP object. 

47. A device employing the method of claim 8 compris- 
ing at least one from a group consisting of a com- 
puter, a server and a handheld electronic device. 

48. A device employing the system of claim 1 compris- 
ing at least one from a group consisting of a com- 
puter, a server and a handheld electronic device. 
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